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Abstract 

We study properties of multi-layered, interconnected networks from an ensemble perspec¬ 
tive, i.e. we analyze ensembles of multi-layer networks that share similar aggregate charac¬ 
teristics. Using a diffusive process that evolves on a multi-layer network, we analyze how 
the speed of diffusion depends on the aggregate characteristics of both intra- and inter-layer 
connectivity. Through a block-matrix model representing the distinct layers, we construct 
transition matrices of random walkers on multi-layer networks, and estimate expected prop¬ 
erties of multi-layer networks using a mean-field approach. In addition, we quantify and 
explore conditions on the link topology that allow to estimate the ensemble average by only 
considering aggregate statistics of the layers. Our approach can be used when only partial 
information is available, like it is usually the case for real-world multi-layer complex systems. 


1 Introduction 

Networks are often used to describe interactions among the elements of a complex system. But 
until recently, the standard assumption in the literature was that networks are isolated enti¬ 
ties and do not interact with other networks. Today we understand that this assumption is a 
rough simplification, since real networks usually have complex patterns of interaction with other 
networks. In order to study more realistic systems, network theory extended its perspective to 
account for these network to network interactions, and to investigate their influence on various 
processes of interest that may use the network topology as substrate mmmm- 

Networks consisting of multiple networks and the connections between them are called intercon¬ 
nected or multi-layer networks |5j. In a multi-layered network each individual layer contains a 
network that is different from the networks contained in other layers, and the layer interconnec¬ 
tivity refers to the fact that nodes in different layers can be connected to each other. Nevertheless, 
it is often possible to extend and apply methods developed for single-layer (isolated) networks to 
multi-layer networks, assuming that all layers and the connections between them are known pre¬ 
cisely. Unfortunately, when creating networks using relational data on real-world systems we are 
often confronted with situations where we lack information about the details of their multi-layer 
structure. In such situations, ensemble-based approaches allow us to reason about the expected 
properties of such networks, provided that we have access to aggregate statistics which can be 
used to define a statistical ensemble. 
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For instance, there are situations in which we are able to precisely map the topology within each 
layer individually, but we may not be able to obtain the detailed topology of connections across 
different layers. As an example, we may consider the topology of connections between users 
in different online social networks (OSNs). Such a system can be represented as a multi-layer 
network, where each layer represents the network of connections between users within one OSN. 
In addition, cross-layer connections are due to users which are members of multiple OSNs at 
the same time, and which can thus drive the dissemination of information across OSNs. Data on 
the network topology within particular OSNs are often readily available, however it is in general 
very difficult to identify accounts of the same user in different OSNs. 

Contrary to the situation described above, we may also consider situations in which detailed 
information on the topology of cross-layer links is available, while the detailed topology of con¬ 
nections within layers is not known. For example, there may be a rather small number of static 
links across layers, while the topology of links within layers is too large and too dynamic to allow 
for a detailed mapping. Again, in such a situation we may still have access to partial, aggregate 
information on the inter-layer connectivity (such as the number of nodes or the density of links) 
which we can use in order to reason about a multi-layer complex system. 

Both the above situations lead to multi-layer networks, and both require us to reason about a sys¬ 
tem with incomplete information. This problem can be addressed from a macroscopic perspective 
using statistical ensembles , and in this work we extend the ensemble perspective to multi-layer 
networks, where we have access to mere aggregate statistics either on links within or across lay¬ 
ers. Combining both detailed and aggregate information on the links in a multi-layer network, 
we first define a statistical ensemble, i.e. a probability space containing all network realizations 
that are consistent with available information. Secondly, we assume a probability mass function 
which assigns a probability to each possible realization in the ensemble. And finally, using either 
analytical or numerical techniques, we use the resulting probability space to reason about the 
expected properties of a network given that it is drawn from the ensemble. 

The rest of the manuscript is structured as follows. In Sect. [2] we present our methodological 
approach to model ensembles of multi-layer networks, we formally introduce the diffusion process 
that is assumed to run on the multi-layer network, and we introduce a method that allows to 
aggregate the statistics of links inside layers and across layers. In Sect. [3] we introduce a mean- 
field approach to approximate ensemble averages, and we investigate under which conditions 
it can be used to argue about diffusion in multi-layer networks. In particular we discuss three 
distinct cases according to different levels of information that we may have about the topology 
of links across the layers or inside the layers. 
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2 Methods and definitions 

In our analysis we investigate a diffusion process that evolves on a static multi-layer network. 
More precisely we focus on diffusion dynamics modeled by a random walk process. In the following 
we provide a short summary that highlights the important properties of this process. Note that 
these are facts already known from the theory of random walks on single layer networks 0119], 
but later on we will extend this framework to multi-layer networks. 

We assume a discrete time random walk process on a network G that consist of n nodes. Starting 
at an arbitrary node, at each step of the process the walker moves to an adjacent node, so that 
for a pair of nodes i,j the probability for a walker to move from node i to node j is given by the 
corresponding entry Tjj of a transition matrix T. Since we have JW P(i — > j ) = 1, the transition 
matrix is row stochastic. 

We further consider a vector 7r f E R n , whose entries 7r| indicate the probability of a random walker 
to visit node i after t steps of the process. Here, we consider ir° as a given initial distribution, 
whose entries tt® give the probability that the random walker has started at node i. The change 
of visitation probabilities n' —> ir t+1 can then be calculated based on the transition matrix as 
follows: 

t r t+1 = tt*T. (1) 

Since this is an iterative process starting with 7To, the visitation probability vector after t time 
steps can be calculated as it 1 = 7r°T*, and we can investigate the long-term behavior of the 
random walk process for t — > oo. For a visitation probability vector 7r* such that 7 t*T = v* , we 
can say that the process reaches a stationary distribution tt*, and if the transition matrix T is 
primitive, the Perron-Frobenius theorem guarantees that such a unique stationary distribution 
7r* exist. 

In order to assess the convergence time of a random walk process, we can study the total variation 
distance between visitation probabilities n* after t steps and the stationary distribution n*. For 
two distributions n and tt' , the total variation distance is defined according to Ref. [6] as 

A(^,7/) := ( 2 ) 

i 

where 7r ? ; indicates the i-th entry of tt. 

As a proxy for diffusion speed, we can now investigate how long it takes until the total variation 
distance /S.(rr t 1 TT*) falls below some given threshold value e (for a small e). In other words, we 
study how many steps t(e) a random walker needs such that A('7r < ,7r*) < e for t > t(e) 

The eigenvalues 1 = Ai > | A 2 1 > • • • > |A n | of a row-stochastic matrix necessarily have absolute 
values that fall between zero and one, while the largest eigenvalue Ai is necessarily one. The 
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number of required time steps t(e) (and thus the diffusion speed of the random walk process) 
can be estimated by means of the the second-largest eigenvalue A 2 of T, 

t(e) ~ InCM ' (3) 

For a detailed derivation see Ref. f?]. Eq. ([3| shows that a second-largest eigenvalue A 2 close to 
one implies slow convergence, while A 2 close to zero implies fast convergence. Therefore in the 
following we use the second-largest eigenvalue of a transition matrix A 2 (T) as a proxy to measure 
and quantify the convergence behavior on a network. 


Multi-layer network 


The purpose of our study is to investigate diffusion processes on ensembles of networks with 
multiple interconnected layers. Thus, in the following we briefly introduce the notion of multi¬ 
layer networks used in this manuscript. Let us consider a multi-layer network denoted by G that 
consist of L non-overlapping layers G\,... ,Gl- Each of these layers Gi is a single-layer network 
Gi = ( Vi,Ei ) where V(Gi ) and E(G{) denote the nodes and links of layer l respectively. We 
call the links E{G{) between nodes within the layers l intra- links. The multi-layer network G 
consist in total of n nodes, where n = |V(Gj)|. In addition, we assume a set Ej( G) of inter¬ 

layer links which connect nodes across layers, i.e. for each edge (u,v) E Ej we have u E V(Gi) 
and V(Gj ) for i ^ j. Inter-layer links induce a multipartite network with the independent sets 
Gr,..., G l . 

In our study we consider undirected and unweighted networks, however some of our results may 
hold even for directed or weighted networks. Furthermore, from the perspective of a random 
walk process, we assume that inter- and intra-layer links are indistinguishable, i.e. transitions 
are made purely randomly irrespective of the type of link. As such, the multi-layer network can 
also be viewed as a huge single network consisting of subgraphs G\,... ,Gl- 


As mentioned above diffusion dynamics on networks can be studied analytically using transition 
matrices of random walkers B !• The multi-layer structure of a network can explicitly be 
incorporated in a random walk model by constructing a so-called siipra-transition matrix mm 
similar to the supra-adjacency matrix used in m ma m- The supra-adjacency matrix of a 
multi-layer network G can be defined in a block-matrix structure as 



( 4 ) 
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On the diagonal we have the adjacency matrices corresponding to the layers 

G\,... ,Gl, thus entries of these block matrices represent the intra-layer links of the multi¬ 
layer network. Off-diagonal matrices A ij for i,j E {1 with i ^ j represent inter-layer 

links that connect nodes in layer Gi to nodes in layer Gj. Since we consider undirected networks 
we have A^- = Ajj. 


Based on a supra-adjacency matrix A we can easily define a supra-transition matrix T of a 
random walker on a multi-layer network G. In block-matrix form such a matrix can be written 
as: 



(5) 


Here, each entry T^j is defined as: 


rr — “u 

~ r » r, ’ 
1 a kj 


( 6 ) 


where aij are the corresponding entries of the supra-adjacency matrix A. Note that, due the 
presence of inter-layer links, block matrices T ^ are in general not equal to the row-normalized 
version of block matrices A ij . The supra transition matrix defined above can be used to model 
a random walk process on a multi-layer network. 


From an analytical perspective the supra-transition matrix can be treated in the same way as 
the transition matrix of a single layer as explained above. In the case of undirected networks 
the eigenvalues of a transition matrix are related to the eigenvalues of the normalized Laplacian 
matrix. In our case we study the second-largest eigenvalue of the supra-transition matrix and 
use it as a proxy for the efficiency of a network with respect to a diffusion process as pointed out 
above. 


Using T we are able to model a diffusion process on a multi-layer network. Since we especially 
want to emphasize the relevance of the inter-links, in the next section we introduce a transition 
matrix that only considers transitions across layers and not between individual nodes. As we will 
see later, this aggregated transition matrix is useful to distinguish the influence of inter-layer and 
intra-layer links on the convergence behavior of a random walk process. 


Multi-layer aggregation 

The supra-transition matrix T introduced previously contains transition probabilities for any 
pair of nodes in the multi-layer network. In this sense T could also be the transition matrix of 
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a large network, which is not divided in separate layers. In order to understand the effects of 
a layered structure, in this section we focus explicitly on transitions across layers. To do this 
we aggregate the statistics of inter-links and the intra-links of all single layers, and thus, we 
homogenize all individual nodes that belong to the same layer. This way we reduce the supra- 
transition matrix T of dimension n to an aggregated transition matrix X of dimension L. We 
call this process multi-layer aggregation and the matrix X the layer-aggregated or just aggregated 
transition matrix. Later on we will provide a relation between the eigenvalues of T and X, which 
will allow us to decompose the spectrum of T. This is important since the convergence behavior 
of a random walk process depends on the second largest eigenvalues of T. 

Let us begin by discussing the construction process of the layer-aggregated transition matrix. 
Our goal is to define transition probabilities across any two layers G s and Gt by averaging the 
transitions between any two nodes of G s and Gt.. Under certain conditions which will be specified 
in the following, these average transition probabilities can be representative for all nodes of the 
different layers. 

Let G be a multi-layer network that consists of L layers G±, ..., Gl- The transition probability 
to go from node Vi to any node Vj in G is defined as 

U^Vj) 

i f u j) — / \ 

lsk u \Vi,V k ) 

where u(vi,Vj) is the weight of a link connecting Vi with Vj. This is a general formalism, but 
since we only consider unweighted networks we have to(vi,Vj) = 1 if and only if there is a link 
between the nodes Vi and Vj. 

For each node Vi in layer G s we require that the transition probabilities P{vi —> *) to nodes in 
another layer Gt fulfill the following equation 


(7) 


a ss ^2 P(vi-^ Vj) = a st ^2 P{vi^>v k ) V^gU(G s ), (8) 

Vj£V(G s ) v k &V{G t ) 

where a s t is a factor that only depends on the layers G s and Gt- The factor is used to 
normalize the transitions, such that ^2 t a st = 1 is satisfied. In other words Eq. (|8]) implies that 
the probability for a random walker at node i to stay inside layer G s is a multiple of the probability 
to switch to layer Gt- 

We can see that a s t is independent of i, and therefore = a s tR S ( where R s t is a row stochastic 
matrix. This means that T s t resembles a scaled transition matrix, and a s t represents the weighted 
fraction of all links starting in G s that end up in Gt■ Thus, we can define the aggregation of a 
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supra-adjacency matrix satisfying Eq. (|8]) as 


/ Oil 1 


au 


UsL \ 






a s i 


&st 


OtsL 






\ OtLl 


OiLt 


cull J 


(9) 


If a multi-layer network G satisfies Eq. (|8j we can follow that the spectrum of the aggregated 
matrix X = {a st }st is 

S'pec(X) = {1, A 2 , ..., \l} , (10) 

and it holds that A 2 ,..., Xl £ Spec( T) (see Prop. [I] in the Appendix). 

This relation implies that the aggregated matrix T preserves L eigenvalues of the supra-transition 
matrix T, where L is the amount of layers. In other words, under the condition that Eq. (|8]) holds, 
we are able to make statements about the spectrum of the transition matrix T only using the 
layer-aggregated transition matrix X. 

Similar to the Fiedler vector, i.e. the eigenvector corresponding to the second smallest eigenvalue 
of the Laplacian matrix, here we may use the eigenvector V 2 corresponding to the second largest 
eigenvalue A 2 of the transition matrix T. The vector V 2 contains negative and positive entries 
and sums up to zero. If all individual nodes that belong to the same layer correspond to entries 
of V 2 with the same sign, we consider the layers of G partitioned according to V 2 , which is also 
called spectral partitioning or spectral bisection mm- In this case, according to Cor. [l] in the 
Appendix, it holds that A 2 (T) = A 2 (T). 

We note that the multi-layer aggregation, performed according to a spectral partitioning, has 
similarities to spectral coarse-graining dJ The multi-layer aggregation presented here decreases 
the state space as well, but still preserves parts of the spectrum. 

The spectral properties introduced in this section are important for our ensemble estimations 
that follows, since we characterize the diffusion process by its convergence efficiency measured 
through the second-largest eigenvalue A 2 (T) of the supra-transition matrix. However, as outlined 
before, if Eq. (|8]) holds then this eigenvalue is equal to the second-largest eigenvalue A 2 (X) of 
the aggregated transition matrix X. Considering that for the construction of X we only used 
aggregated statistics on the network and not the detailed topologies of the inter-links or any of 
the intra-links of all single layers, this already provides a hint how we can treat a system in the 
case of limited information. 


24 






















Nicolas Wider, Antonios Garas, Ingo Scholtes, Frank Schweitzer: 

An ensemble perspective on multi-layer networks 
To appear in: A. Garas (Ed.) Interconnected Multilayer Networks, Springer 2015 


3 Mean-field approximation of ensemble properties 

With the layer aggregation introduced in the previous section, we are now able to deal with multi¬ 
layer network ensembles in case of limited information. In our case, this information concerns 
knowledge either of the inter-link topology between layers or the intra-link topologies of all 
single layers. For our purpose we define ensembles based on the inter-link densities and intra- 
link densities of all single layers, more precisely, by using the amount of nodes, the amount of 
inter-links across any two layers, and the amount of intra-links of all single layers. The number 
of nodes in individual layers are represented by the vector n = {n-i,..., til} and the number of 
links between layers by a matrix M with entries m s t where s gives the source layer and t the 
target layer. Intra-layer links have both of their ends in the same layer and therefore we assume 
that the diagonal elements m ss are equal to the amount of desired intra-links multiplied by two. 
We denote the ensemble defined by these two quantities £(n, M). 

A single random realization of this ensemble satisfies the aggregated statistics given by M and 
n. We assume a random uniform distribution of links and therefore each realization of £(n, M) 
has the same probability. However, instead of single realizations we are rather interested in the 
average values of all possible realizations. For each multi-layer network realization G of £(n, M) 
we build the supra-transition matrix T, which defines a random walk process that is different for 
every realization. As discussed above, a proxy of the convergence quality of these random walk 
processes is given by the second-largest eigenvalue A 2 (T). Our goal is to estimate the average 
A 2 of the ensemble £{n, M), and we do this using a mean-held approach on the supra-transition 
matrix T that is similar to Refs. fT5l 1131 . 

Hereafter we will provide a mean-held approach for the general case, i.e. when the exact topology 
of inter-links and intra-links of all single layers are unknown. Next, building on this approach, 
we will discuss the case for which we have full knowledge of the intra-link topology but we do 
not know the inter-link topology, and the case for which we have full knowledge of the inter-link 
topology but we do not know the intra-link topology. 

Case I: unknown topology of inter-links and intra-links for all layers 

For this case we only assume knowledge of the ensemble parameters M and n. We define a 
mean-held adjacency matrix A with a block structure similar to Eq. ([ 4 ]) , and for each A s t we are 
only given the amount of links equal to m s t . Since we do not know how these links are assigned 
to the entries A s t, without loss of generality we assume a uniform distribution. Thus, for the 
blocks of A we have 

, i G {1, ... ,n s }, j <E {l,...,n t }. (11) 

l n s'n t J „■ 
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Following the discussion of Sect. [2j based on the mean-field adjacency matrix we define a mean- 
field transition matrix T. The transition probability between any two nodes i,j £ G s for a 
fixed layer s is the same since according to the available information individual nodes cannot 
be distinguished based on their connectivity. Further, the transition probabilities between any 
two nodes i £ G s and j £ Gt are the same for any two fixed layers s and t. Therefore, all block 
transition matrices T S £ contain the same value at each entry. Hence we have 


T.t = 


m st 


n t (Efe m sk) 


, i £{!,..., n s }, j £{!,..., n t }. 




Now, using Eq.(|8]) we can construct an aggregated supra-transition matrix T with entries 

m s t 


Ot-st — 


Xfc m sk 


( 12 ) 


(13) 


The aggregated supra-transition matrix T describes the macro behavior of the multi-layer network 
ignoring the detailed topology of the inter-links and the intra-links of all single layers. Since T 
depends on a mean-field approach it only captures probabilistic assumptions of the ensemble 
£(n, M). Thus, the spectrum of the mean-field supra-transition matrix T can be calculated by 


Spec( T) = Spec{ T) U 


U u ?=i l i°} 


\S= 1 


(14) 


To clarify the situation, let us briefly discuss the simple case of a network G that contains only 
two layers G\ and G 2 , for which we get 


1 - ( 1 ~ ° 12 012 y (15) 

\ «21 1 — «21 J 

Hence, for the mean-field matrix of a two-layered network we obtain 

Spec( T) = {1,1 - ai 2 - a 2 i, 0, ■ , 0 } . (16) 

|n|—2 times 

These results are remarkable, since the layer-aggregated transition matrix captures the same 
relevant eigenvalues as the mean-field transition matrix. So, for the case of a diffusion process 
in two layers the eigenvalue of interest is A 2 (T) = 1 — ai 2 ~ 021 - However, so far we only 
considered the general case where we can only use the densities of inter-links and intra-links 
of all single layers. In the following two sections we will investigate cases where we may have 
some additional information about either the inter-link topology between all single layers or the 
intra-link topology of all single layers. For simplicity, we will restrict ourselves to the two layer 
case but, as shown in the appendix, our results can be generalized to multiple layers. 
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Case II: unknown inter-connectivity 


For this case we assume full knowledge of the intra-link topology, i.e. we know exactly which 
nodes are connected in all of the single layers. But while we know the number of links between the 
layers we do not know how the layers are connected, i.e. we do not know the inter-link topology. 
With respect to the general case discussed previously, here we have more information which is 
expected to improve the estimates of the ensemble average. 

More precisely, we consider a two-layer network with unknown inter-link structure denoted by 
Ef(G), but with a given amount of m interconnecting links which connect the networks G\ 
and Cr 2 - This means that the diagonal blocks Ai and A 2 of the supra-adjacency matrix are 
given, but the off-diagonal blocks A 12 and A 21 can take any form such that they have exactly m 
entries different from zero. Since there are no further constraints on the ensemble, any random 
link configuration that consists of m inter-links has the same probability to occur. Therefore, we 
define the mean-field supra-adjacency blocks that correspond to the inter-links, A 12 and A 21 , to 
have the same value in each entry. 

For the supra-transition matrix we have to row-normalize Ai with Ai 2 and A 2 i with A 2 . The row 
sums of A 12 are all equal to m/n\ and the row sums of A 21 are all equal to m/ri 2 , while the row 
sums of Ai and A 2 correspond to the individual degrees of the nodes in G\ and G 2 respectively. 
Thus, we use the mean degree d\ of G\ and cfe of G 2 in order to obtain the row-normalized 
transition matrix T, and to define the following factors 

nidi U2<i2 m m . . 

Ot 1 — -j-: 0-2 — -j-, «12 — - ~ -, 021 — f - • (17) 

nidi + m n 2«2 + nn nidi + m n 2 a 2 + m 

Note that ai + a .12 = 1 and a 2 + « 2 i = 1- 

Accordingly we define the mean transition blocks of T 12 and T 21 . 



for i E {1,.. 

■,ni},j G {1,.. 

. ,n 2 } 

(18) 

for i E {1,.. 

■,nz},3 e {!,•• 


(19) 


This means that each of the off-diagonal block matrices that correspond to the mean-field inter¬ 
link structures have the same value at each matrix element, and the diagonal blocks are just 
rescaled transition matrices of Ai and A 2 , 


Ti = (1 — 0 : 12 ) T(Ai), T 2 = (1 — < 221 ) TXA 2 ), (20) 
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where T(M) is the row-normalized version of matrix M. We denote the supra-transition matrix 
with the blocks constructed as described before by T, 


T = 


Ti 

T12 

T21 

t 2 


( 21 ) 


This mean-held matrix has some special properties. First of all, the eigenvalues of Ti and T2 
are also eigenvalues of T. Further, the multi-layer aggregation of T is given by 

T _ ( OL\ Ot\2 ] _ [ 1 - “12 “12 

y a 21 «2 J y Ct21 1 - «21 

so, the second-largest eigenvalue of T is given by A2 = 1 — «i2 — <221 ■ 

The second-largest eigenvalues of Ti is equal to (1 — 0:12 )A 3 > and of T2 is equal to (1 — »2i)A2, 
where A 3 , = A2(T(Ai)) and A| = A2(T(A2)). Therefore the second largest eigenvalue of T, 
denoted by A2(T), fulhlls the following condition (See Prop. [2] in the Appendix for more details) 

A 2 (T) = max (l — ai2 — 021, (1 — <212)^2, (1 — £*21^2) ■ ( 23 ) 

We would like to remind the reader that an eigenvalue A2 close to one implies slow convergence 
and A2 close to zero fast convergence. From the above equation we can see that as long as 
A2 = 1 — ai2 — a21 is maximal the inter-links are the limiting factor of the convergence in the 
multi-layer network. This means that due to the inter-link topology the random walk diffusion 
is slowed down, and the influence of the intra-layer topologies is marginal to the process. 

When either the term of AJ, or A2 is maximal then the diffusion is limited by the single layer G\ 
or G 2, and the additional information provided by the intra-layer topologies becomes relevant 
as it affects the diffusion process. Note that the change between A2 and either \\ or A2 being 
maximal is related to the transitions pointed out in Ref. mm- 

This behavior is shown in Fig. [l] for the mean-held matrix of two interconnected networks. The 
figure shows the second largest eigenvalues of T, T and the sparsest layer Ti for different amount 
of inter-links. When only a few inter-links are present the interconnectivity between layers slows 
the process down, as it is expected. When we increase the amount of inter-links, we can reach the 
convergence rate of single layers, which is the point where the single layers slow down the process. 
However, with an increasing amount of inter-links the single layers lose their importance and the 
process is again slowed down by the inter-links. This happens because a very large amount of 
inter-links force the random walker to switch between layers with increasing probability, thus, 
preventing diffusion to reach the whole layer. To conclude, the mean-held transition matrix T 
is a better estimation than T in intermediate numbers of interlinks, which for our systems is in 
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Figure 1: Eigenvalues of a mean-field approach of a two-layered network. Layer 1 consists of 
an Erdos-Renyi network of 100 nodes and 500 links and Layer 2 consists of an Erdos-Renyi 
network of 100 nodes and 750 links. The x-axis indicates the amount of inter-links randomly 
added across the layers. The lines indicate the second-largest eigenvalue of: black dashed: the 
mean-field supra-transition matrix A 2 (T), violet: the layer-aggregated matrix A 2 (T), turquoise: 
the larger single layer eigenvalues of A 2 (Ti) and A 2 (T 2 ). 


the region of approximately 550 to 1800 inter-links. Otherwise, the information about the link 
densities as captured in T is enough to approximate the second-largest eigenvalue of T, and thus 
the speed of diffusion. 

In general the spectrum of a mean-field matrix T with unknown inter-link topology is given by 


Spec( T) = {1, A 2 ,..., A n } U 


(J Spec(T s ) \ Ar(T s ) I , 


(24) 


or 


Spec( T) 


Specie) U 


U^ecCTsAArCTs) , 




(25) 


where T is the multi-layer aggregation of T as described before (for details see Prop, [i] in the 
appendix). This decomposition of eigenvalues can also be useful for other network properties 
that depend on eigenvalues. 


So far we provided an estimation based on the eigenvalues of a mean-field transition matrix T 
that intends to approximate the ensemble average. In reality however, ensemble realizations of 
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Figure 2: a) Eigenvalues of a mean-field approach of a two-layered network. Layer 1 and 2 both 
consist of an Erdos-Renyi network of 50 nodes and 100 links but with different topologies. The 
x-axis indicates the amount of random inter-links added across layers. The lines indicate the 
second-largest eigenvalues of: black line: ensemble averages, turquoise line: mean-field estimate 
including intra-link topology, violet dashed line: mean-field only considering densities, b) Eigen¬ 
value difference between ensemble average and mean-field estimation AA 2 = A 2 (T) — A 2 (T). 


multi-layer networks that contain layer G\ and G 2 can deviate from the mean-field estimation. 
This is shown in Fig. [ 2 ] (a) where we plot the second-largest eigenvalues of T, ‘X, and ensemble 
averages over 100 realizations of T against the number of inter-links between G 1 and G 2 . As we 
can see, the magenta colored dashed line showing the mean-field approximation of X is a good 
proxy for the diffusion dynamics in the region when inter-links dominate, which is the case for 
either sparse or very dense inter-link topologies. However, as shown by the cyan colored line, we 
can actually improve this approximation if we additionally consider the intra-links of all single 
layers. 

There is a peak where the difference between the estimation and the ensemble averages AA 2 = 
A 2 (T)-A 2 (T) reaches high values up to 0.225, as shown in Fig. [ 2 ] (b). This happens, on one hand, 
due to the large degree of freedom that comes from the absence of intra-connectivity informations 
within the layers. On the other hand, the mean-field matrix assumes “full-connectivity” across 
layers, and even though this implies small weights for each single inter-link, it leads to a systematic 
bias towards overestimating the diffusion speed. Nevertheless, we would like to highlight that the 
multi-layer aggregation provides a quite accurate estimation of the diffusion speed in the regimes 
where inter-links limit diffusion. 
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Case III: Unknown intra-connectivity 


For this case we assume full knowledge of the inter-link topology, i.e. we know exactly how 
the layers are connected, but the intra-link topologies, i.e. how the nodes are connected within 
the single layers, are unknown. More precisely, we consider two interconnected layers G\ and 
G 2 of a multi-layer network, and we fix the inter-links Ej( G) in a bipartite network structure 
that connects nodes of G*i to nodes of G 2 - Since we have no information about the intra-link 
topologies of G\ and G 2 , we assume random connectivities within the layers, so that we only 
know the average degrees d\ and of G\ and G 2 respectively. This means that the off-diagonal 
blocks A 12 - A 2 i of the supra-adjacency matrix are given, but the diagonal blocks Ai and A 2 
are unknown. 

Because we only know the average degrees d\ and d 2 of the layers, we can define mean-held 
versions of the adjacency matrices such that 


Ai = 


Mil 

, . 1 

\d 2 \ 

) and A 2 = < 

W 

U- 1 

{ 112 1 


v 


However, even though we know the topology of the inter-links, we do not know which nodes 
exactly are connected to each other. Hence we use the same approach as in Case II with m equal 
to the amount of inter-links and the factors defined as in Eq. ([lT]) . Therefore we get the mean-held 
transition matrix T consisting of the following block matrices, 


Ti = 


To = 



for * € rn},j G n 2 } 


for i e n 2 },j 6 {1, — , n{\ . 


,n 2 } 

(26) 

,ni}. 

(27) 

A 21 , 



(28) 

■ eigenvalues A) 

and A|, 


T 12 = ai2T(Ai 2 ), T 2 i = a2iT(A 2 i 


as it was the case in Case II. In particular, depending on the ensemble constraints we could 
only compute an average eigenvalue A 2 for a single layer. Therefore, we can use the following 
maximization term 


A 2 (T) = max (l - ai 2 - a 2 i, (1 - 0112 )^ 2 , (1 - a 2 i)A^ , 


(29) 


which is has the same form as in Case II (see Eq.(23)). Here again, as long as A 2 = 1 — «i 2 — &21 
is maximal the inter-links are the limiting factor of diffusion in the multi-layer network, which 
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means that due to the inter-link topology the random walk diffusion is slowed down, and the 
influence of the intra-layer topologies is marginal to the process. On the other hand, when either 
the average term of A 2 or A 2 is maximal then the diffusion is limited by the single layer G 1 or 
G 2 , and the additional information provided by the intra-layer topologies becomes relevant as it 
affects the diffusion process. 

In Fig. § a), starting with initially empty intra-network^] we plot the second largest eigenvalues 
of T, T, and the ensemble average of 100 realizations of T against the number of intra-links that 
are simultaneously and randomly added in both layers. We observe that the general behavior is 
similar to Fig. [2] Thus, the multi-layer aggregation plotted in magenta approximates well the 
regions where the inter-links are the relevant factor, which is for very sparse and increasingly 
dense intra-links densities. The difference between the mean-field and the ensemble average 
AA 2 = A 2 (T) — A 2 (T) as seen in Figure [i^b) again rises up to a peak of about 0.225. 

Our analysis shows that there is some form of symmetry in knowing the degree of the nodes 
in the single layers, but not knowing how they are connected to nodes in other layers and to 
knowing the inter-links between layers, but not the degree of their adjacent nodes. Even though 
the ensembles generated from these two constraints can be much different, the relevance of inter¬ 
links or intra-links of all single layers to a diffusive process is comparable for both cases. 


4 Conclusion 

In this manuscript, we showed how an ensemble perspective can be applied to multi-layer net¬ 
works in order to address realistic scenarios when only limited information is available. More 
precisely, we focused on a diffusion process that runs on the multi-layer network and its relation 
to the spectrum of the supra-transition matrix. We have shown that the convergence rate of 
the diffusion process is limited by either the inter-links or intra-links of the single layers and we 
identified for which relation of inter-link compared to intra-link densities it is sufficient to only 
consider transitions across layers, instead of the full information on all individual nodes. This 
implies that we do not always need perfect information to make statements about a multi-layer 
network because, under certain conditions, we are still able to make analytical statements about 
the network only using partial information. In realistic situations data can be an issue either 
due to availability constraints or due to their vast amounts. In such cases, even though an exact 
analysis is impossible, we may still derive useful conclusions about processes that depend on the 
network spectrum (like diffusion and synchronization) using only aggregated statistics. 


1 Note that even though the intra-layer networks are empty initially, there is a number of inter-layer links which 
provide connectivity across the layers, similar to a bipartite network. 
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Figure 3: a) Eigenvalues of a mean-field approach of a tow-layered network with 250 inter-links. 
Layer 1 and 2 both consist of 50 nodes but no edges. The x-axis indicates the amount of links 
intra-links that are simultaneously added to both layers. The lines indicate the second-largest 
eigenvalue of: black line: ensemble averages, turquoise line: mean-field estimate including inter¬ 
links, dashed violet line: mean-field only considering link densities, b) Eigenvalue Difference 
between ensemble average and mean-field approach AA 2 = A 2 (T) — A 2 (T). 


For our study we assumed the simplest case of random networks, therefore exploring other ways to 
couple the network layers or including link-weights and directed links and testing their influence 
on our results is up to future investigation. 


Acknowledgements. N.W., A.G. and F.S. acknowledge support from the EU-FET project 
MULTIPLEX 317532. 


5 Appendix 

Note: Unless stated otherwise, here vectors are considered to be row-vectors and multiplication 
of vectors with matrices are left multiplications. 

We assume a multi-layer network G consisting of L layers G±,... ,Gl and n nodes. A single 
layer G s contains n s nodes and therefore Yls=\ n s = n - For a multi-layer network G we define 
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the supra-transition matrix that can be represented in block structure according to the layers: 


( T1 


Tn 


T sL \ 






T,! 


T st 


t sL 






l T ^l 


T L t 


t l ) 


Each T s t contains all the transition probabilities from nodes in G s to nodes in Gt■ Assuming 
Eq.(|8]) it follows that T s t = a S (R S ( where R s t is a row stochastic matrix. This means that all 
T s t are scaled transition matrices. The factor a s t represents the weighted fraction of all links 
starting in G s that end up in Gt. 

In this respect we define the aggregated transition matrix T of dimension L, 


( ot 11 


oiit 


a s L \ 






Ot 8 l 


Oist 


OtsL 






\ 


an 


&L ) 


Each vector v of dimension n can be split according to the layer-separation given by G, 


(30) 


Each component v^ has exactly dimension nk- We define the layer-aggregated vector V) = 
(tq,..., x>l) of dimension L as follows 


\/k e L} 


n k 

E K’ 


We use the bracket notation [n]* to represent the z-th entry of the vector v. Analogously, by 
[rM]j we mean the z-th entry Wi that represents the multiplication of v with a matrix M, i.e. 
w = uM. Further, by |u| we indicate the sum of the entries of v, |u| = Yli v i = 


Theorem 1. For a multi-layer network G consisting of L layers we assume the supra-transition 
matrix T to consist of block matrices T S f such that for all s, t G {1, ..., L}, T s t = a s tR S i where 
a s t G Q and R s * is a row stochastic transition matrix. The multi-layer aggregation is defined 
by T = {a s z} s z. If an eigenvalue A of the matrix T corresponds to an eigenvector v with a 
layer-aggregation n that satisfies 11> | ^ 0 then A is also an eigenvalue of T. 
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Proof. Assume v is a left eigenvector of T corresponding to the eigenvalue A. Therefore, it holds 
that uT = Xv. Let v^ be the k-th part of v that corresponds to the layer Gk■ We can write the 
matrix multiplication in block structure. 



Each v^ is a row vector which length is equal to the amount of nodes nj~ in Gk- The transfor¬ 
mation v^Tik is also a row vector with the same length as v^ k \ According to the eigenvalue 
equation it holds that for all k £ {1,..., L} 

\v (k) = (vT) {k) = JV')T i k . 

i 

Now let us denote the sum of the vector entries of v^ as 

■‘AM, 

i 

Further, we define layer-aggregated vector consisting of this sums by t> = (tq,..., 0 n ). Note 
that for a general row stochastic matrix M and its multiplication with an eigenvector v to the 
eigenvalue A it holds that A Y2ji v \j = 'Yhj [^M] j■ For the components after multiplication with T 
we can deduce 


£[< 


vT 


,(fc) 


£ 


i l l 


Y v{l)T j* 


£ 


i L l 


Y aikV^R-ik 


YY [ w ( 0 r m ]. = Y aik [ 


,(0 


l i l 

If we multiply 0 with T and look at a single entry of bT we get 


Yj • 

l 


Hence it holds that 


and therefore 


= y = Y Viaik ■ 

l l 


Wk = Y [(^ T ) (fe) 

i 
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Finally since T is row stochastic and \v = t;T we have that 

Ah = A t> n ) 


EK’],,-,E[w 


(l) 


= t)T. 

Therefore, A is also an eigenvalue of T to the eigenvector t) defined as before. It is important to 
note that this only holds if |t)| ^ 0. □ 


The procedure used in the proof of the previous theorem applies to several eigenvalues of T but 
at most L of them. Next we give a proposition for the reversed statement of Thm [lj 

Proposition 1. Let G be a multi-layer network that consists of L layers and fulfills all of the 
conditions of T/im [7J Let T = {a s t} s j be the multi-layer aggregation of T. If A is an eigenvalue 
of‘I then A is also an eigenvalue of T. 


Proof Assume that A is an eigenvalue of T. For each matrix there exist a left and right eigenvector 
that correspond to the same eigenvalue A. Assume the tu is the right eigenvector and therefore a 
column vector. Hence ‘Xtn = Ain and 


T 


Tro = ( "jP aijtoj, otkjXOj, ..., ^ a Gto.j = Arc 


(31) 


Now we generate a column vector w of dimension n such that for all the layer components w^ 
it holds that 

P fc) = (ttifc,... ,tt>fc) T . (32) 

Next we perform a right multiplication of w with T, 


T 


Tw = ( £ Tu«;W T kl w (l \ T liw 


(0 


T 


r, a u^-u w> ' i \ ■ ■ ■ > ^2 • • •, ^2 a LiRuw 


(0 
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Since all R s * are row stochastic matrices and contains only the value t xy for each entry we 
get R s tw^ = vj( 1 \ It follows that 

Tw = ('yi a u w ®’ ■ • ■ ■ ■, 'Y] agw^ 

V i i i ) 

= A w. 

And therefore A is also an eigenvalue of T. □ 


In the case of a diffusion process we are especially interested in the second-largest eigenvalue 
of T, denoted by A 2 (T), which is related to algebraic connectivity of T. In this perspective the 
following corollary is useful: 

Corollary 1. Let G be a multi-layer network consisting of L layers that fulfill all of the conditions 
of Thm\ 1} Further assume that G is partitioned according to a spectral partitioning, i.e. according 
to the eigenvector corresponding to A 2 (T), then A 2 (T) = A 2 (T). 

Proof. In general all the eigenvectors of a transition matrix, except the eigenvector corresponding 
to the largest eigenvalue that is equal to one, sum up to zero. However, these eigenvectors consist 
of positive and negative entries that allow for a spectral partitioning. Especially the eigenvector 
V 2 that corresponds to the second-largest eigenvalue A 2 (T), can be used for the partitioning 
of the network. This eigenvector is related to the Fiedler vector that is also used for spectral 
bisection m Therefore if the layer-partition of G coincides with this spectral partitioning we 
assure that the layer-aggregated vector of t >2 satisfies 11>21 d 0. Considering this and Prop [I] the 
corollary follows directly from Thm [l] □ 

Given Eq.([8]) we can fully describe the spectrum of T based on the intra-layers transition blocks 
Tj for i E {1,..., n} and the spectrum of T. Note that with uniform columns of a matrix M we 
mean that each column of M contains the same value in each entry. However, this value can be 
different for different columns. 

Proposition 2. Let T be the supra-transition matrix of a multi-layer network G that consist of 
L layers and satisfies Eq. |5|). If T has off-diagonal block matrices Tst, for s,t E {1,..., n} and 
s d t, that all have uniform columns, then the spectrum of T can be decomposed as 

S P ec( T) = {1, A 2 , • • •, A l } U f (J Spec{T s ) \ (Ai(T s )} j , (33) 

where T s are the block matrices of T corresponding to the single layers G s and Ai(T s ) the largest 
eigenvalue of T s . The eigenvalues A 2 , • • ■, A l ar e attributed to the interconnectivity of layers. 
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Proof. To prove this statement we just have to show that all eigenvalues (except the largest 
one) of T s for s € {1 are also eigenvalues of T. We assume that A is any eigenvalue 

corresponding to the eigenvector u of some block matrix T r , i.e. Xu = uT r . We define a row 
vector v that is zero everywhere except at the position where it corresponds to T r . The vector v 
looks like v = (0,..., 0, u, 0,..., 0). Now we investigate what happens if we multiply this vector 
with the transition matrix T. 



Let us take a look at the effect of the matrix multiplication on an arbitrary component v^ with 
k ^ r and recall that v^ is equal to a zero vector 0 for k ^ r. 

(p T )W = = £ OT/fc + uT rk = uT rk . 

I 

Note that all eigenvectors u of a transition matrix that are not related to the largest eigenvalue 
sum up to zero. Therefore it holds that uT rk = 0 since T rk has uniform columns and therefore 
uT rk yields in a vector where each entry is equal to some multiple of |it|. In case of k = r it 
holds that v^ = u and we get 

(t-T) W v {l) T lr = 0T ik + uT rr = uT r = A r . 

I Z,Z^7* 

Hence, it holds that cT = Xv, which means that A is also an eigenvalue of T. This way we get 
n — L eigenvalues of T apart from the largest eigenvalue that is equal to one. The remaining 
eigenvalues denoted by A 2 , ■ ■ •, A l are not attributed to any block matrix of T. Therefore they 
are considered to be the interconnectivity eigenvalues. □ 

Corollary 2. Let G be a multi-layer network consisting of L layers that satisfies Eq. and the 
conditions of Prop [1| Then the aggregated matrix T = {a s t}st has spectrum 

Spe.c(T) = {1,A 2 ,...,Al}, 

and it holds that A 2 ,..., A l £ Spec( T). 

Proof. Note that every eigenvalue A 7 ^ 1 of some block matrix T r with Xu = uT r is by Prop [2] 
also an eigenvalue of T. Furthermore, A is attributed to the eigenvector v = (0,..., 0, u, 0,..., 0) 
of T. However |u| = 0 since u is an eigenvector of a transition matrix, not related to the largest 
eigenvalue, and therefore sums up to zero. Hence all eigenvalues fulfilling this condition are by 
Thm[l]not eigenvalues of T. Since T contains at least L eigenvalues that by Prop[l]also correspond 
to eigenvalues of T, the remaining eigenvalues A 2 ,..., A l have to also be eigenvalues of T. □ 
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In the following we provide a useful proposition for the eigenvalues arising from the inter-links 
in case of two layers. Note that by the function T(-) applied to matrix M we indicate that T(M) 
is the row-normalization of M. 

Proposition 3. Let G be a multi-layer network that satisfies Eq. |$|), consisting of two networks 
G i and G 2 in separate layers. Assume that the supra-transition matrix T has the form 



Ti 

T 12 ) 

= 

( m Ai) 

(1-/3)T{ 2 \ 

T 21 

t 2 

V (1-/5)T 7 21 

fiT( A 2 ) ) 


where is the transition matrix of the layer G that only consists of the inter-layer links and 
(3 £ Q is a constant. Furthermore, assume that Ti and T 2 have uniform columns. 

Then for A £ Spec( T 1 ) and A 1, —1 it holds that (1 — j3) A £ Spec(T). 

Proof. If v is an eigenvector to the eigenvalue A 1, —1 of it holds that vT 1 = An. Hence, 
vT 1 = (u ( 1 ) ,u (2) ) T 1 = (V 2 )Ty u (1) T{ 2 ,) = A (u ( 1 ) ,u (2) ) . 

Because At/ 2 ) = t/ 1 )T{ 2 , we get A 2 ?/ 1 ) = t/ 1 )T{ 2 T 21 . Therefore, t/ 1 ) is also an eigenvector of the 
transition matrix T{ 2 T 21 to the eigenvalue A 2 . Note that A 1, —1 hence A 2 < 1 which implies 
that t/ 1 ) does not correspond to the largest eigenvalue and therefore its entries sum up to zero. 
The same holds for t/ 2 ) and the matrix T 21 T{ 2 . For the multiplication of v with T we deduce 
that 

uT = (w^y 2 )) T = (v < ^ 1) T l + (l- fi)v^T{ 2 + v^T 2 ^ . 

Since Ti and T 2 have uniform columns we get t/^Ti = 0 and ?/ 2 )T 2 = 0. And therefore 
vT = (1 - p)\v and (1 - fi)X G Spec( T). □ 

Proposition |3] can be extended to multiple layers, however the proof is more involved and will be 
omitted. 
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